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Foreword 

This Technical Specification (TS) has been produced by ETSI 3rd Generation Partnership Project (3GPP). 

The present document may refer to technical specifications or reports using their 3GPP identities, UMTS identities or 
GSM identities. These should be interpreted as being references to the corresponding ETSI deliverables. 

The cross reference between GSM, UMTS, 3GPP and ETSI identities can be found under 
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Foreword 



rd , 



This Technical Specification has been produced by the 3 Generation Partnership Project (3GPP). 

The contents of the present document are subject to continuing work within the TSG and may change following formal 
TSG approval. Should the TSG modify the contents of the present document, it will be re-released by the TSG with an 
identifying change of release date and an increase in version number as follows: 

Version x.y.z 

where: 

X the first digit: 

1 presented to TSG for information; 

2 presented to TSG for approval; 

3 or greater indicates TSG approved document under change control. 

y the second digit is incremented for all changes of substance, i.e. technical enhancements, corrections, 
updates, etc. 

z the third digit is incremented when editorial only changes have been incorporated in the document. 
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Scope 



The present document contains an electronic copy of the ANSI-C code for the Floating-point Extended Adaptive Multi- 
Rate Wideband codec. Alternatively, fixed-point ANSI-C code is specified in 3GPP TS 26.273 [1]. The floating-point 
codec/encoder/decoder specified in this document or the fixed-point codec/encoder/decoder specified in [1] may be 
used depending on if the implementation platform is better suited for a floating-point or a fixed-point implementation. It 
has been verified that the fixed-point and floating-point codecs interoperate with each other without any artifacts. 

The floating-point ANSI-C code in the present document defines, besides the fixed-point c-code specified in [1], one 
valid reference implementation of the Extended Adaptive Multi-Rate Wideband transcoder (3GPP TS 26.290 [2]). 
Standard conformance is enforced by meeting the conformance criteria defined in [3]. 



References 



The following documents contain provisions which, through reference in this text, constitute provisions of the present 
document. 

• References are either specific (identified by date of publication, edition number, version number, etc.) or 
non-specific. 

• For a specific reference, subsequent revisions do not apply. 

• For a non-specific reference, the latest version applies. In the case of a reference to a 3GPP document (including 
a GSM document), a non-specific reference implicitly refers to the latest version of that document in the same 
Release as the present document. 

[1] 3GPP TS 26.273: "ANSI-C code for the Fixed-point Extended AMR Wideband codec". 

[2] 3GPP TS 26.290: " Audio codec processing functions; Extended AMR Wideband codec; 

Transcoding functions ". 

[3] 3GPP TS 26.274: " Audio codec processing functions; Extended Adaptive Multi-Rate - Wideband 

(AMR-WB-h) codec; Conformance testing ". 

[4] 3GPP TS 26.244: "Transparent end-to-end packet switched streaming service (PSS); 3GPP file 

format (3GP)" 



3 Definitions, symbols and abbreviations 

3.1 Definitions 

For the purposes of the present document, the terms and definitions are given in TS 26.290 [2]. 

3.2 Abbreviations 

For the purposes of the present document, the following abbreviations apply: 

AMR-WBh- Extended Adaptive Multi-Rate WideBand 

ANSI American National Standards Institute 

GSM Global System for Mobile communications 

I/O Input/Output 

RAM Random Access Memory 

ROM Read Only Memory 



£75/ 



3GPP TS 26.304 version 8.0.0 Release 8 6 ETSI TS 1 26 304 V8.0.0 (2009-01 ) 



C code structure 



This clause gives an overview of the structure of the C code and provides an overview of the contents and organization 
of the C code attached to the present document. 

The C code has been verified on the following systems: 

IBM PC/AT compatible computers with Windows 2000 SP4 and Microsoft Visual C++ v.6.0 compiler. 

ANSI-C was selected as the programming language because portability was desirable. 

4.1 Contents of the C source code 

The C code distribution has the files divided in five different directories, all present in the directory c-code. The 
directories are: common, decoder, encoder, lib_amr and include. The distributed files with suffix "c" contain the source 
code and the files with suffix "h" are the header files. 

Project and workspace files are provided in the directory MSVC. 

4.2 Program execution 

The Extended Adaptive Multi-Rate Wideband codec is implemented in two programs: 

(encoder) audio encoder; 

(decoder) audio decoder. 
The programs should be called like: 

encoder [encoder options] -if <audio input file> -of <parameter file>; 

decoder [decoder options] -if <parameter file> -of <audio output file>. 

The input files contain one or two channels of 16-bit linear encoded PCM audio samples stored in the wav file format 
and the parameter files contain encoded audio data and some additional flags. 

The encoder and decoder options will be explained by running the applications without input arguments. See the file 
readme.txt for more information on how to run the encoder and decoder programs. 

4.3 Code hierarchy 

Tables 1 and 2 are call graphs that show the functions used in the audio codec. 

Each column represents a call level and each cell a function. The functions contain calls to the functions in rightwards 
neighbouring cells. The time order in the call graphs is from the top downwards as the processing of a frame advances. 
All standard C functions: memcpyO, fwrite(), etc. have been omitted. The initialization of the static RAM (i.e. calling 
the _init functions) is also omitted. 
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Table 1 : Encoder call structure 



coder_amrwb_plus 
stereo 



decim 12k8 



hp50_12k8 



mix ch 



EUTILfpreemp 
h 



mvr2r 



coder If 



Interpol 



mvr2r 



E UTIL autocorr 



lag_wlnd 



E LPC lev dur 



E_LPC_a_lsp_con 
version 



int_lpc_npl 



flnd_wsp 



E_GAIN_lp_decim 
2 



E_GAIN_open_loo 
p_search 



E_GAIN_olag_me 
d 



E_LPG_lsp_lsf_co 
nverslon 



qplsf_2s_46b 



E_LPC_chebyshev 



E_LPG_f_lsp_a_c 
onverslon 



E_LPC_lsp_f_pol_ 
get 



E_LPG_a_welght 



E UTIL resldu 



E_UTIL_deemph 



E GAIN sort 



VQstagel (qplsf_ 
2s.c) 



sub_VQ{qplsf_2c.c 

} 
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isf2isp 



coder_acelp 



qpisf_2s_46b 



E_LPC_isf_reorder 
Plus 



E UTIL residu 



E_LPC_a_weight 



E_UTIL_deemph 



mvr2r 



set zero 



E_UTIL_synthesis 



E_UTIL_f_preemp 
h 



E_GAIN_closed_lo 
op_search 



pred_lt4 



E_UTIL_f_convolv 
e 



E__ACELP_xy1_co 
rr 



E_ACELP_codebo 
ok_target_update 



E_GAIN_f_pitch_s 
harpening 



E ACELP xh corr 



E ACELP 4t 



EGAINnormcor 
r 



EGAINnormcor 
rjnterpolate 



EUTILfconvolv 

8 



E_AGELP_h_vec_ 
corr1 



E_ACELP_ 
corr2 



h vec 



E_ACELP_ 
search 



.2pulse 



E_ACELP_quant_ 
4p_4N 



E_AGELP_quant_ 
1p_N1 



E_ACELP_quant_ 
3p_3N1 
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E ACELP quant_ 
2p 2N2 


E ACELP quant 
1p N1 


E ACELP quant 
4p_4N1 




E ACELP quant 
2p 2N2 


E ACELP quant 
2p 2N2 




E_ACELP_xy2_co 
rr 








q_gain2_plus 




segsnr 








coderjcx 




cos_window 






E_LPC_a_weight 




E_UTIL_residu 




E_UTIL_deemph 




fft9 




fft_rel 




adap low freq em 
Ph 






AVQ_cod 




RE8_PPV 






nearest neighbor 
2D8 




adap_lo_freq_dee 
mph 








ifft9 




ifft_rel 




get_gain 






q_gain_tcx 




E UTIL f preemp 
h 




E_UTIL_synthesis 




mvi2i 








coderhf 








EUTILautocorr 






lag_wind 




E_LPC_lev_dur 




E_LPC_a_isp_con 
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band_split_taligne 
d 2k 



coder stereo x 



version 



int_lpc_npl 



mvr2r 



E_LPC_isp_isf_co 
nversion 



q_isf_hf 



isf2isp 



match_gain_6k4 



int_gain 



E UTIL residu 



E_UTIL_synthesis 
Plus 



E_LPC_a_weight 



E_UTIL_residuPlu 
s 



q_gn_hf 



E_LPC_chebyshev 



E_LPC_f_isp_a_c 
onversion 



E_LPC_isp_fjDol_ 
get 



sub_VQ{q_isf_hf.c 
) 



E_LPC_isf_reorder 
Plus 



set zero 



E UTIL residu 



E_UTIL_synthesis 
Plus 



Interpol 



cod hi stereo 



mvr2r 



residu 



cholsolc 



smooth ener filter 



quantjilt 



pmsvq 



msvq 
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int2bin 



encjDrm 



cod tax stereo 



fir filt 



my_max 



quant_gain 



m cbcod 



min_msvq 



pmsvq 



msvq 



m cbcod 



set zero 



mvr2r 



ctcx stereo 



segsnr 



mvi2i 



AVQ encmux 



splitJdx_noovf 



cos window 



get_gain 



q_gain_pan 



fft3 



adap_low_freq_em 
ph 



AVQ cod 



adap_lo_freq_dee 
mph 



ifftS 



q_gain_tcx 



fft re! 



RES PPV 



nearest_neighbor_ 
2D8 



ifft re! 



sort(avq_cod.c) 
RES cod 



RES_vor 






reS_identify_absol 
ute 


Leader 


reS_coord 
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re8_k2y 






RE8_PPV 




nearest neighbo 
r 2D8 


re8_compute_bas 
ejndex 








re8_compute_rank 
_of_permutation_a 
nd s 






calc_bits 








writ_all_nq 








calc_bits 




writ_all_i 






init_pos_i_ovf 








chk_ovf 






chk_ovf 








writ_l 






writ_k 




writ_ovf 




int2bin 








unpack4bits 




int2bin 




encjDrm_stereo_x 








int2bin 




AVQencmux 




split_idx_noovf 




sort(avq_cod.c) 




RE8_cod 








RE8_vor 






re8_identify_absol 
ute 


Leader 




re8_coord 






re8_k2y 




RE8_PPV 




nearest neighbo 
r 2D8 


re8_compute_bas 
ejndex 








re8_compute_rank 
_of_pernnutation_a 
nd s 
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calc_bits 




writ_all_nq 






calc_bits 




writ_all_i 






init_pos_i_ovf 




chk_ovf 


chk_ovf 




writ_l 




writ_ovf 




writ_k 




unpack4bits_d 








int2bin 




encjDrm_hf 








int2bin 





Table 2: Decoder call structure 



decoder_am rwb_pl 
us 



bin2int 



decjDrm 



decjDrm_stereo_x 



bin2int 



pack4bits(dec_prm 
•c) 



bin2int 



bin2int 



pack4bits_d 



AVQ demuxdec 



bin2int 



read_all_nq(avq_d 
ec.c) 



read_all_i{avq_dec 
•c) 



read_nq(avq_dec. 
c) 



init_pos_i_ovf{avq 
dec.c) 



split_n(avq_dec.c) 



chk_ovf{avq_dec.c 



chk_ovf(avq_dec.c 

J 
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decjDrm_hf 



decoder_am rwb_pl 
us 1 



bin2int 



mvr2r 



decoder If 



RES dec 



read_l(avq_dec.c) 



read_ovf(avq_dec. 
c) 



read_k(avq_dec.c) 



re8_decode_base 
index 



re8_k2y 



re8_decode_rank_ 
of_permutation 



RES PPV 



nearest_neighbor_ 
2D8 



mvr2r 



qpisf_2s_46b 



isf2isp 



int_lpc_npl 



decoder tax 



E_LPC_isf_reorder 
Plus 



E_LPC_f_isp_a_c 
onversion 



E_LPC_isp_f_pol_ 
get 



AVQ demuxdec 



read_all_nq(avq_d 
ec.c) 



read_al IJ (avq_dec 
■c) 



read_nq{avq_dec. 




initjDOS_i_ovf(avq 
_dec.c) 



Split_n(avq_dec.c) 



chk_ovf(avq_dec.c 
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decoderacelp 



set zero 



rnd_ph16 



adap_lo_freq_dee 
mph 



mvr2r 



find_mpitch{dec_tc 
x.c) 



ifft9 



d_gain_tcx 



cos window 



EUTILfpreemp 
h 



E_LPC_a_weight 



EUTILsynthesis 



E UTIL residu 



RE8 dec 



chk_ovf(avq_dec.c 
} 



read_l(avq_dec.c) 



read_k{avq_dec.c) 



read_ovf(avq_dec. 



re8_decode_base 
index 



re8_l<2y 



re8_decode_rank_ 
of_permutation 



RE8 PPV 



nearest_neighbor_ 
2D8 



E UTIL random 



ifft rel 



pred_lt4 



D_ACELP_decode 
4t 



D_ACELP_decode 
_4p_4N 



D_ACELP_decode 
1p_N1 



D_ACELP_decode 
_3p_3N1 
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D ACELP decode 
2p 2N1 


D ACELP decode 
1p N1 


D ACELP decode 
2p 2N1 




D ACELP decode 
4p 4N1 




D ACELP decode 
2p 2N1 


D_ACELP_add_pu 
Ise 




E UTIL f preemp 
h 






E_GAIN_f_pitch_s 
harpening 




d_gain2jDlus 




EUTILsynthesis 




E_LPC_a_weight 




E_UTIL_residu 




E_UTIL_deemph 




mvr2r 




set_zero 




E_UTIL_deemph 








bass_postfilter 




mvr2r 




short_pitch_tracker 
(bass pf.c) 




decoder_hf 






d_isf_hf 






E LPC isf reorder 
Plus 




isf2isp 






intjpc_npl 


E_LPC_f_isp_a_c 
onversion 






E_LPC_isp_f_pol_ 
get 




mvr2r 






match_gain_6k4 






set_zero 




E_UTIL_residu 
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decoder stereo x 



delay 



oversamp_12k8 



int_gain 



d_gain_hf 



soft exc hf 



E_UTIL_synthesis 
Plus 



smooth ener hf 



Interpol 



mvr2r 



band_split_tallgne 
d 2k 



dec tcx stereo 



dec hi stereo 



Interpol 



dtcx stereo 



my_min 



my_max 



dec flit 



dec_gain 



mvr2r 
resldu 
firjilt 
syn_fllt 



EUTILsynthesIs 
Plus 



cos window 



adap_lo_freq_dee 
mph 



ifftS 



d_gain_tcx 



crosscorr 



glev_s 



ifft rel 



pmsvqjnv 



msvqjnv 



pmsvqjnv 



msvqjnv 
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delay 




bandJoin_2k 




Interpol 


hp50_12k8 






oversamp_1 2k8 




Interpol 
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4.4 



Variables, constants and tables 



4.4.1 Description of fixed tables used in the C-code 

This clause contains a listing of all fixed tables declared in tables_plus.c and tables_stereo.c files. 

Table 3: Encoder fixed tables 



Format 


Table name 


Size 


Description 


Float32 


NBITS CORE 


8 


Gore bit-rates 


Float32 


T sin 


1152 


FFT Sine table 


Float32 


T cos 


1152 


FFT Cosine table 


Float32 


filter 32k 


61 


FIR table for decimation/oversampling 


Float32 


filter 32k hf 


61 


FIR table for decimation/oversampling 


Float32 


filter 32k 7k 


61 


FIR table for decimation/oversampling 


Float32 


filter 48k 


185 


FIR table for decimation/oversampling 


Float32 


Filter 48k hf 


185 


FIR table for decimation/oversampling 


Float32 


filter 8k 


61 


FIR table for decimation/oversampling 


Float32 


isf init 


16 


Initial ISF memory 


Float32 


Mean isf 


16 


Means of ISFs 


Float32 


Died isf 


2304 


1 St stage codebook, isfO to isf8 


Float32 


Dico2 isf 


1792 


1 St stage codebook, isf9 to isf 1 5 


Float32 


Dico21 isf 


192 


2nd stage codebook, isf2 to isf 2 2 


Float32 


Dico22 isf 


384 


2nd stage codebook, isf2_3 to isf 2_5 


Float32 


Dico23 isf 


384 


2nd stage codebook, isf2 6 to isf 2 8 


Float32 


Dico24 isf 


96 


2nd stage codebook, isf2_9 to isf 21 1 


Float32 


Dico25 isf 


128 


2nd stage codebook, isf2_12 to isf 2_15 


Float32 


Dico21 isf 36b 


640 


1st stage codebook, (36b) split 1 


Float32 


Dico22 isf 36b 


512 


1st stage codebook, (36b) split 2 


Float32 


Dico23 isf 36b 


448 


1st stage codebook, (36b) split 3 


Float32 


Dico_gain_hf 


512 


Quantization table for one-stage HF gain 


Float32 


Mean isf hf 12k8 


8 


Means of ISFs (full band) 


Float32 


dico1 isf hf 12k8 


32 


1 nd stage isf codebook (full band) 


Float32 


mean isf hf low rate 


8 


Means of isfs 


Float32 


Dico1 isf hf low rate 


32 


1 St stage isf codebook 


Float32 


dico2 isf hf 


1024 


2nd stage isf codebook 


Float32 


Lag window 


17 


Lag window 


Float32 


Filtjp 


13 


Low-pass fir filter for bass post filter 


Float32 


Sin20 


20 


Random phase 


Float32 


Inter4 2 


65 


Va resolution interpolation filter 


Float32 


VadFiltBandFreqs 


12 


Open-loop classifier 


Float32 


Bw 


12 


Open-loop classifier 


Float32 


Lwg 


8 


Open-loop claissifier 


Float32 


Gain_hf_ramp 


64 


HF gain ramp for wb->wb-i- switching 


Float32 


Inter2 coef 


12 


Filter coefficients for band join/split 


Float32 


Filter LP180 


2341 


Filter for 48 kHz interpolation 


Float32 


StereoNbits 


18 


Stereo bit-rates 


Float32 


Filter 2k 


321 


2k decimation filter 


Float32 


Cb flit hi mean 


9 


Average filter 


Float32 


Flit hi mscb4a 


16*9 




Float32 


Flit hi mscb 7a 


16*9 




Float32 


Flit hi mscb 7b 


8*9 




Float32 


Cb_gain_hi_mean 


2 


Average gain vector 


Float32 
Float32 


Gain_hi_mscb_2a 

Gain_hi_mscb_5a 

TBC 


4*2 
32*2 
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Table 4: Decoder fixed tables 



Format Table name Size Description 



Same as encoder 



4.4.2 Static variables used in tine C-code 

In this clause two tables that specify the static variables for the encoder and decoder respectively are shown. All static 
variables are declared within a C struct. 
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Table 5: Encoder static variables 



struct name 


type 


variable 


size 


description 


Coder StState 












float 


mem decim 


1608 


speech decimated filter memory 




int 


decim frac 


1 


Fractional decimation factor 




float 


mem sig in 


4 


hp filter memory 




float 


mem preemph 


1 


speech preemphasis filter mem 




float 


mem decim hf 


46 


HF filter memory 




float 


old speech hf 


528 


HFold speech vector 




float 


past q isf hf 


8 


HF past quantized isf 




float 


ispold hf 


8 


HFold isp 




float 


ispold q hf 


8 


HF quantized old isp 




float 


old gain; 


1 


HFold gain match 




float 


mem hf1 


8 


HF memory for gain 1 




float 


mem hf2 


8 


HF memory for gain 2 




float 


mem hf3 


8 


HF memory for gain 3 




float 


old exc 


375 


old excitation 




float* 


mean isf hf 


1 


isf codebook mean 




float* 


died isf hf 


1 


isf codebook first stage 


Coder State Plus 












Coder StState 


left 


2614 


state for left channel 




Coder StState 


right 


2614 


state for right channel 




float 


old Chan 


528 


old left signal 




float 


old Chan 2k 


140 


old left signal 2kHz sampl. rate 




float 


old Chan hi 


448 


old left signal HB 




float 


old speech 2k 


140 


old mono signal 2kHz sampl. 
rate 




float 


old speech hi 


448 


old mono signal HB 




float 


old speech pe 


528 


past pre-emphasised mono 




float 


old wh 


9 


past weighted filter 




float 


old wh q 


9 


past quantized weighted filter 




float 


old gm gain 


2 


past gain matching 




float 


old exc mono 


9 


past mono excitation 




float 


tilt energy threshold 


1 


filter energy thershold 




float 


w window 


64 


weighting window 




PMSVQ* 


*filt_hi pmsvq 




MSVQ quantizer 




PMSVQ* 


*gain hi pmsvq 




IVISVQ quantizer 




int 


mem stereo ovip size 




past stereo overlap size 




float 


mem stereo ovIp 


32 


past stereo overlap 




NCLASSDATA 


*stClass 




use case B classifier 




VadVars 


*vadSt 




VAD state 




short 


vad hist 




VAD history 




float 


old speech 


528 


old speech 




float 


old synth 


16 


synthesis memory 




float 


past isfq 


16 


past isf quantizer 




float 


old wovlp 


128 


last tcx overlap 




float 


old d wsp 


187 


Weighted speech vector 




float 


old exc 


392 


old excitation vector 




float 


old mem wsyn 




weighted synthesis memory 




float 


old mem wO 




weighted speech memory 




float 


old_mem_xnq 




quantized target memory 
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int 


old ovip size 


1 


last tcx overlap size 




float 


isfold 


16 


old isf frequency domain 




float 


ispold 


16 


old isp 




float 


ispold q 


16 


quantized old isp 




float 


mem wsp 


1 


wsp vector mem 




float 


mem Ip decim2 


3 


wsp decimator filter mem 




float 


ada w 


1 


open loop LTP 




float 


ol gain 


1 


open loop LTP 




short 


ol wght fig 


1 


open loop LTP 




long int 


old ol lag 


5 


past openloop lag 




int 


old TO med 


1 


past pitch 




float 


hp old wsp 


699 


past HP weighted speech 




float 


hp ol Itp mem 


7 


past HP openloop long term 
prediction 




float 


window 


512 


LP analysis window 




short 


SwitchFlagPlusToWB 


1 


flag for switching to AIVIR-WB 




float 


mem gain code 


4 


past code gain 




short 


prev_mod 


1 


past frame type 
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Table 6: Decoder static variables 



struct name 


type 


variable 


size 


description 


Decoder StState 












float 


mem oversamp 


72 


Memory oversampling 




int 


over frac 


1 


Fractional overcloking factor 




float 


mem oversamp hf 


24 


memory 




float 


past q isf hf 


8 


HF past quantized isf 




float 


past q isf hf other 


8 


HF past quantized isf for the other 
channel when mono decoding stereo 




float 


past q gain hf 


1 


HF past quantized gain 




float 


past q gain hf other 


1 


HF past quantized gain for the other 
channel when mono decoding stereo 




float 


old gain 


1 


HF old gain match 




float 


ispold hf 


8 


HFold isp 




float 


threshold; 


1 


HF memory for smooth ener 




float 


mem syn hf 


8 


HF synthesis memory 




float 


mem d tcx 


96 


delay compensation memory 




float 


mem d none 


64 


Non causality delay 




float 


mem synth hi 


16 


High band sunthesis memory 




float 


mem sig out 


4 


hp filter memory 




float 


old synth hf 


512 


synch delay memory 




float 


Ip amp 


1 


memory for soft exc 




float* 


mean isf hf 


1 


isf codebook mean 




float* 


dico1 isf hf 


1 


isf codebook first stage 


Decoder State Plus 












Decoder StState 


left 


828 


State for left channel 




Decoder StState 


right 


828 


State for right channel 




float 


mem left 2k 


20 


2kHz memory on left chan 




float 


mem right 2k 


20 


2kHz memory on right chan 




float 


mem left hi 


64 


HB memory left channel 




float 


mem right hi 


64 


HB memory right channel 




float 


my old synth 2k 


35 


old 2kHz synthesis 




float 


my old synth hi 


128 


old HB synthesis 




float 


my old synth 


148 


old stereo synth 




float 


old AqLF 


85 


old quantized LPC 




float 


old wh 


9 


old decoded filter 




float 


old wh2 


9 


old decoded filter 2 




float 


old exc mono 


9 


old mono excitation 




float 


old gain left 


4 


old gain on left chan 




float 


old gain right 


4 


old gain on right chan 




float 


old wh q 


9 


past quantized filter 




float 


old gm gain 


2 


past gain matching 




float 


w window 


64 


weighted synthesis window 




PMSVQ 


*filt hi pmsvq 


1 


past MSVQ filter 




PMSVQ 


*gain hi pmsvq 


1 


past IVISVQ gain 




int 


mem stereo ovip size 


1 


past stereo overlap size 




float 


mem stereo ovip 


32 


past stereo overlap 




int 


last stereo mode 


1 


past stereo mode 




float 


side rms 


1 


side signal RMS 




float 


h 


9 


current filter 




float 


mem balance 


1 


past balance factor 
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int 


fer hist 


500 


frame errasure history 




int 


fer hist ptr 




frame erasure pointer 




float 


fer mean 




frame erasure mean 




float 


old xri 


1148 


old spectral coefficeints 




int 


last mode 




last mode in previous 80ms frame 




float 


mem sig out 


4 


hp50 filter memory for synthesis 




float 


mem deemph 




speech deemph filter memory 




int 


prev Ipc lost 




previous Ipc is lost when = 1 




float 


old synth 


16 


synthesis memory 




float 


old exc 


392 


old excitation vector 




float 


isfold 


16 


old isf (frequency domain) 




float 


ispold 


16 


old isp (immittance spectral pairs) 




float 


past isfq 


16 


past isf quantizer 




float 


wovlp 


128 


last weighted synthesis for overlap 




int 


ovip size 




overlap size 




float 


isf buf 


51 


old isf (for frame recovery) 




int 


old TO 




old pitch value (for frame recovery) 




int 


old TO frac 




old pitch value (for frame recovery) 




short 


seed ace 




seed memory (for random function) 




float 


mem wsyn 




TCX synthesis memory 




short 


seed tcx 




seed memory (for random function) 




float 


wsyn rms 




rms value of weighted synthesis 




float 


past gpit 




past gain of pitch (for frame recovery) 




float 


past gcode 




past gain of code (for frame recovery) 




int 


pitch tcx 




for bfi 




float 


gc threshold 




GC threshold 




float 


old synth pf 


503 


Bass post-filter: old synthesis 




float 


old noise pf 


24 


bass post-filter: noise memory 




int 


old T pf 


2 


bass post-filter: old pitch 




float 


old gain pf 


2 


Bass post-filter: old pitch gain 




float 


*mean isf hf 


1 


HF isf codebook in-use 




float 


*dico1 isf hf 


1 


HF isf codebook in-use 




float 


mem gain code 


4 


past code gain 




float 


mem Ipc hf 


9 


past HF Ipc filter 




float 


mem gain hf 


1 


past HF gain 




short 


ramp state 


1 


ramp state 





5 File formats 

This clause describes the file formats used by the encoder and decoder programs. 

5.1 Audio file (encoder input/decoder output) 

Audio files read by the encoder must be formatted as 16 bits PCM wave (*.wav) files. The decoder output is written as 
a 16 bit PCM wave file (*.wav). 

Note that the decoder, with proper command line switch, can produce a mono file from a stereo bit-stream. 
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5.2 Parameter bitstream file (encoder output/decoder input) 

For AMR-WB+ operation, the files produced by the audio encoder/expected by the audio decoder are either according 
to the raw format defined in Reference [2] Section 8.2, or according to the 3GP file format [4], whereby the storage 
sample definition is found in Reference [2] Section 8.3. 
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Annex A (informative): 
AI\/IR-WB+ user guide 



This Annex contains a user guide on how to configure AMR-WB+ codec parameters. Due to the codec flexibihty, 
different configurations can be used to operate the codec at a certain bit rate. The aim of this annex is ease the 
understanding of how to set the parameters of the AMR-WB+ audio codec. 

There are two ways to set the AMR-WB+ parameters: in simple mode and flexible ('expert') mode. The simple mode 
uses a predefined set of configurations at different bit rates. In the flexible mode, the user chooses the core bit rate and 
the stereo extension rate, in case of stereo operation, and the internal sampling frequency, to attain a certain bit rate. 

The AMR-WB+ audio codec processes input frames always equal to 2048 samples at an internal sampling frequency Fs. 
The internal sampling frequency (ISF) is independent from the audio sampling rate and is limited to the range from 
12800-38400 Hz. The 2048-sample frames are split into two critically sampled equal frequency bands. This results in 
two superframes of 1024 samples corresponding to the low frequency (LF) and high frequency (HF) bands. Each 
superframe is divided into four 256-sample frames. Because the codec always requires 2048 samples then the input 
frame size or frame duration will vary for different ISFs. 



A.1 Encoder usage 
A. 1.1 Simple mode 

Simple mode is easy to use and requires no knowledge of AMR-WB+ to use the full capacity of the codec. The usage is 
as follows: 



AmrwbPlusEncode -rate <bit rate> [-mono] [-ff <3gp|raw>] -if <infile.wav> - 
of <outf ile . wb+> 

Where 

AmrwbPlusEncode Name of the AMR-WB+ encoder program either compiled from the floating-point C- 
code of this specification or from the fixed-point C-code of [1]. 

- rate Bit rate from 6-36 kbps for mono encoding or 7-48 kbps for stereo encoding 
-mono Forces mono encoding for stereo inputs. 

If this option is not used the encoder performs mono encoding for mono WAV files and 
stereo encoding for stereo WAV files. 

- f f File format: 

3gP 
raw 
The default is 3gp file format. 

-if Input audio WAV file with one (mono) or two (stereo) channels 

Supported audio sampHng rates are 8, 16, 24, 32, 48, 11.025, 22.05, 44.1 kHz 

-of Output file (according to the -ff argument) 



The codec will use the best combination of mono and stereo bit rates and internal sampling frequency (ISF) according 
to the bit rate specified by the user. In this simple mode, the codec uses a set of predefined configurations in the bit rate 
range from 6-48 kbps. The codec will choose the closest configuration to the required bit rates. The configurations have 
been chosen to optimize the quality/bandwidth trade-off at a certain bit rate for 48 kHz sampled input. 

Tables A. 1 and A. 2 show the default codec configurations for mono and stereo operation, respectively. In the tables, 
the mapping from selected bit rate to mode index and ISF index are given, and in addition the resulting bit rate factor 
and coded audio bandwidth (BW). The mode index (or frame type) is defined and explained in Table 25 of 26.290 [2] 
(modes 16 to 23 are mono modes and modes 24 to 47 are stereo modes). The mapping from ISF index to ISF, 
corresponding frame size and bit rate factor is defined in Table 24 of 26.290 [2]. 
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Table A.1 Mono default configurations 



bit rate 


Mode Index (bit rate at 


ISF index (sampling 


Bit rate 


BW 


(kbps) 


nominal ISF of 25.6 kHz) 


frequency) 


factor 


(kHz) 


5.85 


16 (10.4 kbps) 


2 


(14.4 kHz) 


9/16 


7.2 


6.933 


16 (10.4 kbps) 


4 


(17.067) 


2/3 


8.533 


7.8 


16 (10.4 kbps) 


5 


(19.2 kHz) 


3/4 


9.6 


8.667 


16 (10.4 kbps) 


6 


(21.33 kHz) 


5/6 


10.67 


9.75 


16 (10.4 kbps) 


7 


(24 kHz) 


15/16 


12 


11.25 


17 (12 kbps) 


7 


(24 kHz) 


15/16 


12 


12 


17 (12 kbps) 


8 


(25.6 kHz) 


1 


12.8 


13.6 


18 (13.6 kbps) 


8 


(25.6 kHz) 


1 


12.8 


15.2 


19 (15.2 kbps) 


8 


(25.6 kHz) 


1 


12.8 


17.1 


19 (15.2 kbps) 


9 


(28.8 kHz) 


9/8 


14.4 


18.9 


20 (16.8 kbps) 


9 


(28.8 kHz) 


9/8 


14.4 


21.6 


21 (19.2 kbps) 


9 


(28.8 kHz) 


9/8 


14.4 


24 


21 (19.2 kbps) 


10 


(32 kHz) 


5/4 


16 


26 


22 (20.8 kbps) 


10 


(32 kHz) 


5/4 


16 


30 


23 (24 kbps) 


10 


(32 kHz) 


5/4 


16 


32 


23 (24 kbps) 


11 


(34.13 kHz) 


4/3 


17.06 


33.75 


23 (24 kbps) 


12 


(36 kHz) 


45/32 


18 


36 


23 (24 kbps) 


13 


(38.4 kHz) 


3/2 


19.2 




Table A.2: 


Stereo default configurations 


bit rate 


Mode Index (bit rate at 


ISF index (sampling 


Bit rate 


BW 


(kbps) 


nominal ISF of 25.6 kHz) 


frequency) 


factor 


(kHz) 


6.975 


24 (12.4 kbps) 


2 


(14.4 kHz) 


9/16 


7.2 


8.267 


24 (12.4 kbps) 


4 


(17.067 kHz) 


2/3 


8.533 


9.3 


24 (12.4 kbps) 


5 


(19.2 kHz) 


3/4 


9.6 


10.33 


24 (12.4 kbps) 


6 


(21.33 kHz) 


5/6 


10.67 


11.65 


24 (12.4 kbps) 


7 


(24 kHz) 


15/16 


12 


13.125 


26 (14 kbps) 


7 


(24 kHz) 


15/16 


12 


14.25 


28 (15.2 kbps) 


7 


(24 kHz) 


15/16 


12 


15 


29 (16 kbps) 


7 


(24 kHz) 


15/16 


12 


16 


29 (16 kbps) 


8 


(25.6 kHz) 


1 


12.8 


17.2 


31 (17.2 kbps) 


8 


(25.6 kHz) 


1 


12.8 


18 


32 (18 kbps) 


8 


(25.6 kHz) 


1 


12.8 


19.2 


34 (19.2 kbps) 


8 


(25.6 kHz) 


1 


12.8 


20 


35 (20 kbps) 


8 


(25.6 kHz) 


1 


12.8 


22.5 


35 (20 kbps) 


9 


(28.8 kHz) 


9/8 


14.4 


23.85 


37 (21.2 kbps) 


9 


(28.8 kHz) 


9/8 


14.4 


25.2 


38 (22.4 kbps) 


9 


(28.8 kHz) 


9/8 


14.4 


27 


40 (24 kbps) 


9 


(28.8 kHz) 


9/8 


14.4 


30 


40 (24 kbps) 


10 


(32 kHz) 


5/4 


16 


32 


40 (24 kbps) 


11 


(34.13 kHz) 


4/3 


17.06 


34.13 


41 (25.6 kbps) 


11 


(34.13 kHz) 


4/3 


17.06 


36 


41 (25.6 kbps) 


12 


(36 kHz) 


45/32 


18 


37.6875 


43 (26.8 kbps) 


12 


(36 kHz) 


45/32 


18 


40.2 


43 (26.8 kbps) 


13 


(38.4 kHz) 


3/2 


19.2 


43.2 


44 (28.8 kbps) 


13 


(38.4 kHz) 


3/2 


19.2 


45 


46 (30 kbps) 


13 


(38.4 kHz) 


3/2 


19.2 


48 


47 (32 kbps) 


13 


(38.4 kHz) 


3/2 


19.2 



A. 1.2 Flexible mode 

Flexible ('expert') mode requires more knowledge of AMR-WB+ to use the full capacity of the codec. In this mode, the 
user can choose the core bit rate and the stereo extension rate, in case of stereo operation, and the internal sampling 
frequency, to attain a certain bit rate. 

The usage is as follows: 
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AmrwbPlusEncode -mi <Tnode> [-isf <factor>] [-Ic] [-dtx] [-ff <3gp/raw>] 
-if <infile.wav> -of <outf ile . wb+> [-cf <conf igf ile . txt>] 

Where 

AmrwbPlusEncode Name of the AMR-WB+ encoder program either compiled from the floating-point 

C-code of this specification or from the fixed-point C-code of [1]. 
-mi Mode Index 

0..8 AMR-WB 

10.. 13 AMR-WBh- special modes 

16.. 23 AMR-WBh- mono modes 

24.. 47 AMR-WBh- stereo modes 

-isf Internal Sampling Frequency factor 

0.5. .1.5 
If this option is missing, the default is 1.0 

- Ic Low complexity (for AMR-WB+ modes) 

If this option is missing, the default is to use normal encoding. 

Note: This option is designed for terminal-based encoding. For optimal quality, it 

is recommended to not use this parameter. 

- dtx Enables V AD/DTX functionality (only for AMR-WB modes) 

If this option is missing, the default is NO DTX 

- f f File format: 

3gP 
raw 
If this option is missing, the default is 3gp file format. 

-if Input audio WAV file 

Supported audio sampHng rates are 8, 16, 24, 32, 48, 1 1.025, 22.05, 44. 1 kHz 
Modes 0..8 require 16 kHz input audio sampling rate. 
Modes 10.. 13 require 16 or 24 kHz audio sampling rate. 

- of Output file (according to the -ff argument) 

- cf Configuration file: an auxiliary file that can be used for bit rate switching 



The mode index (mi) can be found in Tables 21 and 25 of 26.290 [2] and the Internal Sampling Frequency (ISF) in 
Table 24 of 26.290 [2]. 

Table 21 of 26.290 [2] contains the AMR-WB-compatible modes and four AMR-WB+ special modes (mode index 10- 
13). The AMR-WB+ special modes have a fixed ISF (ISF index = from Table 24 of 26.290 [2]). The codec can switch 
dynamically between the AMR-WB and AMR-WB+ special modes (Table 21 of 26.290 [2]) if AMR-WB+ is operated 
at 16 kHz. 

Table 25 of 26.290 [2] contains the AMR-WB+ mono and stereo modes. The core and stereo mode indices of Table 25 
of 26.290 [2] correspond to Tables 22 and 23 of 26.290 [2]. The bit rates specified in Table 25 of 26.290 [2] are for a 
nominal ISF of 25600 Hz (bit rate factor = 1.0). The output bit rate can be computed by multiplying the bit rate value 
from Table 25 of 26.290 [2] and the bit rate factor from Table 24 of 26.290 [2]. 



Stereo flexibility 

In case of stereo operation, the flexible mode provides some degree of flexibility for trade-off between mono and stereo 
extension bit rates. This can be content dependent where higher or lower stereo extension bit rates can be used 
depending on the correlation between the two channels. In Table 25 of 26.290 [2], there are 24 stereo modes (mode 
indices 24 to 47). These modes correspond to the 8 core mono modes where 3 different extension rates are combined 
with each core mode. For example, mode indices 39, 40, and 41 correspond to a core bit rate of 19.2 kbps combined 
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with stereo extension rates of 4.0, 4.8, and 6.4 kbps, respectively. This results in total bit rates of 23.2, 24.0, and 25.6 
kbps. In these stereo modes, the ratio between the stereo extension rate and the total bit rate is 17.1%, 20%, and 25%, 
respectively. 



Choosing encoded bandwidth 

The flexibility in choosing the ISF gives the user the choice to adjust the coded audio bandwidth depending on the input 
signal. For instance, in case of speech signals, a bandwidth up to 14 kHz is sufficient to attain transparent quality. 

To determine the AMR-WB+ mode, one approach could be to first determine the bandwidth of the signal that required 
to be encoded. Table 24 of 26.290 [2] can be used to choose the ISF. Then the core and stereo rates are chosen from 
Tables 25, 22, and 23 of 26.290 [2]. (Note that these bit rates should be scaled with the bit rate factor from Table 24 of 
26.290 [2].) 

Further tuning can be done by adjusting the ISF. The HF encoding uses relatively few bits compared to the LF; 
therefore, the LF part of the signal has a higher definition than the HF. Therefore, increasing the ISF can be considered 
even in cases where the resulting bandwidth might exceed the input signal bandwidth if the bit budget allows it. 

The graph below explains the three different possibilities. 



Amplitude 



Amplitude 



Amplitude 




Case A 



CaseB 



CaseC 



Frequency 



Case A is when the ISF is set to be smaller than the signal spectrum. This can be used when the bit rate or the CPU load 
has to be reduced. 
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Case B is probably the most usual situation. The signal spectrum is matched with the ISF. 

Case C can be used when high quality is required and an adequate bit rate budget is available. In this case, the HF 
exceeds the signal bandwidth, but the bits allocated for the HF will be used to encode the active part only. For example, 
this case can be used to encode a signal with input bandwidth limited to 14 kHz at a bit rate of 36 kbps. If we use mode 
index 36 (24 kbps at nominal ISF) and ISF of 38.4 kHz (bit rate factor 1.5), the resulting bit rate will be 36 kbps with 
the LF encoded up to 9.6 kHz and the HF from 9.6 to 14 kHz. 



Bit rate switching using a configuration file 

Bit rate switching can be simulated using an auxiliary configuration file. The option -cf refers to a text file that allows 
for changing Mode Index and ISF dynamically during a program run. The configuration file contains a time reference, a 
specific extension (AMR-WB or AMR-WB+), mode index (mi), and an internal sampling frequency (ISF), used to 
encode at that specific time. The encoder keeps the last setting to encode the remaining part of the file. To use -cf option 
to switch between AMR-WB and AMR-WBh- special modes, input files at 16 kHz sampling rate need to be used and 
the decoder needs to use the option -fs 16000 (see Section A.2 below). 

Each configuration file consists of 4 columns consisting of "time" "ext" "mode_index" "fscale". 

"time" is specified in seconds and must always be > 0. 

"extension" is or 1 for choosing AMR-WB or AMR-WBh- modes. 

"mi" = [0..47], where [0..15] is AMR-WB and AMR-WBh- special modes, and [16..47] is for AMR-WB+ extension 
modes 

"isf"= [0.5. .1.5] for AMR-WB+ extension modes, and represent the bit rate factor, 'isf is set to zero for mode indices 
to 15. 

The following is an example of a configuration file for switching between AMR-WB and AMR-WB+ special modes. 



#time 


Extension 


mi 


isf 


0.08 


1 


10 


0.0 


1.08 





7 


0.0 


2.08 


1 


10 


0.0 


3.08 








0.0 



Here, 'time' is in seconds, and 'extension'=0 means AMR-WB and 'extension'=l means AMR-WB+. The value of 'isf 
for modes 0-15 must be zero. In this example, the encoder will use the initial configuration up to first 80 ms. At 80 ms it 
will start encoding with AMR-WB+ mode 10 (13.6 kbps). At time instant 1080 ms, it will start encoding with AMR- 
WB mode 7 (23.05 kbps). At time instant 2080 ms, it will start encoding with AMR-WB+ mode 10. Finally, at time 
instant 3080 ms, it will start encoding with AMR-WB mode (6.6 kbps) till the end of the file. This can be seen as 
using the initial configuration for the first 80 ms, using mode 10 for 1 second, using mode 7 for 1 second, using mode 
10 for 1 second, and using mode for the remaining of the file. 



In the above example, if the following command line is used: 

AmrwbPlusEncode -ff raw -mi 12 -cf switch_amrwb.txt -if Input.wav -of bit_stream 
then the first 80 ms will be encoded with AMR-WB+ mode 12. 

The example below shows a configuration file where mode index and ISF are switched. 



#time Extension mi isf 

0.5 1 16 1.0 
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1.0 


I 20 


1.5 


2.0 


I 23 


1.5 


3.0 


[ 35 


0.8 


4.0 


[ 40 


1.0 


5.0 


I 47 


1.5 


10.0 


I 23 


1.0 



A.2 Decoder usage 

The decoder usage is as follows: 



AmrwbPlusDecode [-ff <3gp/raw>] [-fs <fs_Hz>] [-mono] [-limiter] -if 
<infile.wb+> -of <outf ile . wav> [-fer <error.txt>] 

Where 

AmrwbPlusDecode 



-ff 



-fs 


-mono 


-limiter 


-if 


-of 


-fer 



Name of the AMR-WB+ decoder executable either compiled from the floating-point C- 
code of this specification or from the fixed-point C-code of [1]. 

File format: 

3gP 

raw 
The default is 3gp file format. 

Output sampling rate (if this option is missing the default value of 48 kHz is used) 
Forces mono decoding for stereo encoded inputs. 

To avoid output clipping (recommended) 

Input AMR-WB+ bitstream file (according to the -ff argument ) 

Output audio WAV file 

Frame erasure file for simulating frame erasures. 



The output sampling rate can be chosen in function of the hardware. The decoder is able to support 8, 16, 24, 32 and 
48kHz or 1 1.025, 22.050 and 44.1kHz. The sampling rate cannot be changed at run time. The default value is 48 kHz 
which is recommended. If the decoder receives modes to 15 the default sampling frequency is 16 kHz. 

The limiter option will increase quality when the signal contains saturations and will not degrade otherwise. It is 
recommended to always activate the limiter option. 

Frame erasure text files can be used for simulating frame erasures. They are constructed as text files (ASCII) with one 
flag "0" or "1" per line. There is one line per codec (transmission) frame. "0" indicates proper reception of the frame and 
"1" a frame erasure. 
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